GPU Timing, Synchronization, Stream Dependencies, Performance Measurement

A hitchhiker's guide to CUDA programming
seanzhang.me·1d·
Discuss: Hacker News
🎯GPU Kernels
Flag this post
A portable picokernel for async I/O
ryansepassi.com·16h·
Discuss: Hacker News
📊Profiling Tools
Flag this post
Get Ready for Clojure, GPU, and AI in 2026 with CUDA 13.0
dragan.rocks·2d·
Discuss: Hacker News
🌊CUDA Streams
Flag this post
Nvidia GPU Boost: My Stock RTX 5080 Is Consistently Beating Advertised
news.ycombinator.com·2h·
Discuss: Hacker News
📈GPU Occupancy
Flag this post
Bringing Ideas to Life with 3D Design and Smart Performance Tools
bottleneckscalculators.com·1d·
Discuss: DEV
📈Occupancy Optimization
Flag this post
Strix Halo's Memory Subsystem: Tackling iGPU Challenges
chipsandcheese.com·21h·
Discuss: Hacker News
📈GPU Occupancy
Flag this post
Cycle-accurate 6502 emulator as coroutine in Rust
github.com·4h·
Discuss: Hacker News
📊Profiling Tools
Flag this post
Your GPU isn't hitting 100% utilization, and that's completely fine
xda-developers.com·8h
📈GPU Occupancy
Flag this post
Where to Buy or Rent GPUs for LLM Inference: The 2026 GPU Procurement Guide
bentoml.com·1d·
Discuss: Hacker News
🎯GPU Kernels
Flag this post
Hyper-Dimensional Bayesian Optimization for Enhanced Statistical Process Control
dev.to·6h·
Discuss: DEV
⏱️Benchmarking
Flag this post
DGX Spark UMA can trick you
bartusiak.ai·1d·
Discuss: Hacker News
🎯GPU Kernels
Flag this post
Microstutter in games? Your RGB software might be why
howtogeek.com·3h
📈Occupancy Optimization
Flag this post
Challenging the Fastest OSS Workflow Engine
obeli.sk·1d·
🔧PTX
Flag this post
Engineering Driver Reignites Battlemage B770 GPU Speculation
techpowerup.com·23h
🔧PTX
Flag this post
Feature Infrastructure Engineering: A Comprehensive Guide
mlfrontiers.substack.com·3h·
Discuss: Substack
ONNX Runtime
Flag this post
Red Hat Catches CUDA Train at NVIDIA GTC, Adds AI-Ready Security and DPU Support
lxer.com·1d
🎯GPU Kernels
Flag this post
4x RTX 3090 Setup for Wan2.2-TI2V-5B (FP16)
textimage2video.py·3d·
Discuss: r/LocalLLaMA
📈GPU Occupancy
Flag this post
In-DRAM TRNG Using Simultaneous Multiple-Row Activation (ETH Zurich, CISPA)
semiengineering.com·1d
📊Profiling Tools
Flag this post
The Hidden Ledger of Code: Tracking the Carbon Debt Inside Our Software
hackernoon.com·5h
🏗️Build Optimization
Flag this post